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ABSTRACT 

^ component analytic method for analyzing 
multivariate longitudinal deta is presented that does not make strong 
assumptions about the structure of the data. Central to the method 
are the facts that components are derived as linear composites of the 
observed or manifest variables and that the components must provide 
an adequate representation of the observed variables. Specifically, 
the components are derived so as to minimize the sum of squared 
errors in the linear regression of observed component variables. 
Although no structural assumptions are required, the method derives 
components under a variety of stationary constraints. An advantage of 
the method for researchers studying change at the individual level is 
that scores on the derived components are uniquely calculable and 
have clear properties. The method might, therefore, be usefully 
applied as a precursor to the application of strong moiels for change 
in the component scores at the individual level. The method can 
easily be generalized to encompass data measured longitudinally m 
multiple groups. A program has been written in FORTRAN to perform the 
method's analysis, although the program is not "exportable" at this 
time. An example of the application of the component analysis method 
is presented. (TJH) 
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A number ol researchers have proposed extensions of the common 
factor model to treat longitudinal multivariate data, Cattell (1963), 
Corballis and Traub (1970), Hakstian (1973), Joreskog and Sorbom (1977), 
McDonald (1984:, Swaninathan (1984), and Tiaak (1984) being a few 
examples. A different factor analytic tradition that is also applicable 
to such data is represented by the work in three-mode factor analysis 
begun by Tucker (1963, 1964, 1966) and continued by Kroonenburg and De 
Leeuw (1980), sands and Young (1980), and Harshman and Berenbaum (1981), 
amons others. Finally, we have more recent developments, whose rooU 
lie in earlier work by Tucker (1958) and Rao (1958), that combine factor 
analytic methods with models of individual growth curves. Meredith and 
Tisak (1984) and McArdle (1986) are actively developing these methods, 
and Molenaar's (1985) work in dynamic factor analysis is similar in 
spirit. 

If these diverse methods have anything in common, It is that they 
begin by assuming rather strong models, at least with respect to the 
covariance structure in the data. For example, the common factor 
approach assumes the common factor model at each occasion, typically 
with an identical number of factors at each occasion. Further 
assumptions are made concerning the covariance structure of the unique 
factors over occasions. With additional distributional assumptions, 
large sample tests of fit are available. If the basic model provides an 
adequate fit, we can proceed to test more restrictive models in a nested 
fashion. These further restrictions u.»ually involve stationarlty 
constraints on portions of the model, such as the factor pattern 
matrices. The ultimate goal is usually to discover which aspects of the 
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mode] can iDe taken as stationary over occasions, leadin? to conclusions 
about stability in factor structure. 

In what follows, I will present a component analytic method for 
analysing multivariate longitudinal data that docs not make strong 
assumptions about the structure of the data. Although no structural 
assumptions are required, the method derives components under a variety 
of stationarity constraints as explained below. An advantage of the 
method for researchers studying change at the individual level is that 
scores on the derived components are uniquely calculable and have clear 
properties. The method might therefore be usefully applied as a 
precursor to the application of strong models for change in the 
^mponent scores at the individual level. The method can easily be 
generalized to encompass data measured longitudinally in multiple 
groups, although I will not explore this extension in this oaper. 

The ''assumption free" nature of the component method to be 
presented can be a virtue or a vice, depending upon our timidity in 
making assumptions about the data at hand. I share the sentiments 
expressed by Rogosa and others (Rogosa, Brandt, & Zlmows!;;i, 1962; Rogosa 
& Willett, 1965) concerning the need for strong models of change at the 
individual level. When we are unable or unwilling to formulate such 
models however, the method to be presented is a useful data-analytic 
strategy. 

The Component Hethod 
The view of component analysis which underlies the method proposed 
here was given in a recent paper by Meredith & Millsap (1965). A key 
idea is that components are derived as linear composites of the observed 
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or manifeat variables, and that the components must provide an adequate 
representation ol the observed variables. Specifically, the components 
are derived so as to minimize the sum ol squared errors in the Unear 
regression ol observed on component variables- Let X be an nxl vector 
ol observable random variables and deline a mxl vector ol component 
score variables Z = W X, where W is an nxm matrix ol compositing 
weights. Assume €(X) = 0 and c(XX' ) = E. The paper by 
Meredith and Hillsap demonstrated that all forms ol component analysis, 
weighted or unweighted, can be formulated in terms ol linding the 
compositing matrix W which maximizes the lunction 

F(W;2;g) = triw'sGSEWcw'ao'^J. (i) 
The matrix G is an nxn nonslngular weightin<T matrix that can be used to 
dillerentially weight the elements ol X in deriving the components, as 
discussed lor example in Mulaik (1972). The matrix G might be used to 
adjust the metric ol X, or to weight the elements ol X in proportion to 
their reliability. There are many other possibilities. 

Xiis multiple- occasion case . Consider a more general lunction 
similar to F in (l) 

H(U:A,B) = tr{u'AU(u'BU)-l} (2) 
with A and B both qxq matrices, and U a qxs matrix. Let V be a qaxl 
supervector containing the s colunns ol U in concatenated lorm, 

beginning with the lirst coltian. Thus 11 U = Cuj, Ug n^j, then 

/ / / / 
V = Cui, ug U5I . Now let V be subject to r linear constraints 

ol the lorm 

c'v > H (3) 
with C an qsxr matrix and H an rxl vector, both chosen a priori. Some 



5 



01 the r equations in (3) may be equalities. In what follows, we will 
assume that we have an algorithm for finding a matrix U that maximizes H 
in (2), subject to the constraints in (3), for given matrices A and B. 
Clearly, the function F in (1) is a special case of H in (2) in which XJ 
is unconstrained. A = SGS. and B = S. The constraints in 
(3) could be used to set upper and^or lower bounds on the elements of U. 
to fix various elements of U to selected values, or to equate elements 
of U. More complex linear constraints are possible as well. 

To return to the multiple-occasion case now let X be a 
supervector composed of n random variables observable on each of p 



occasions: 



XpL 



Let e(XD 



and 



€(XX ) = E, a supermatrix of the fcrm 



S = 



Sll ... Zip 
?21 ?22 ... 2^ 



(4) 



^1 ^2 • • • ^p 

We can define an mxl vector of component scores at the 1th occasion as 
Z< = W^Xj. with Wj an nxm conpositing weight matrix to 
be applied to the observablle variables Xj. Let be an npxmp 
block diagonal matrix whose diagonal submatrices are the matrices Wj 



nw = 



Wi 0 0 ...0 
0 0 0. .0 



(5) 



0 0 0 «^ 

We can now write the supervector of component scores Z as 

Z - CZ^. Zg Zpl, with Z = D^X. It also follows 

that eC3z') = D^av and €(Xz') = TD^. 

As in the single occasion case, we want to choose the compositing 
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weights so that the resulting components provide an adequate 
representation of the observed variables. Consider the linear 
regression of observed on component variables 

€(X|2) = PZ = X (6) 
with ? an npxmp component pattern matrix. At this point we must make a 
choice. We can regress each Xj on all elements of Z, or we can 
restrict the regression to be "within occasion", regressing Xj on 
Zi- I will term the first alternative the "full information" case, 
since we use all component information in the regression. I will term 
the second alternaUve the "limited information" case, because only 
•withln-occasion" information is utilized. This choice is significant 
because the solutions for in the two cases need not be identical. 

The full information case . In this case we take P to be a full 
iLatrix. and we can consider the matrix P which will minimize the 
weighted risk function 

Rp = €t(X - x/g(X - h) - cCtr«G(X - X)(X - X))l (7) 
In (7), G is a matrix of weights to be applied in the regression of X on 
Z, and has a role analogous to G in the single occasion case in (1). 
The matrix P which minimizes Rp is 

P = n^jx^y^ (8) 
Clearly P is a function of D^. and we can choose to minimize 
Rp or .0 maximize 

H(i\;:aas;E) = trtriasro^ivajw)-!) (9) 

The algorithm for solving H in (2) can utilize the linear 
constraints In (3) to force TS to have a block diagonal form D,,. 
Further constraints can be used to identify and to impose 
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stationarity constraints over occasions. In the extreme case, we might 
require = W lor all i, resulting in stationary compositing 
weights. Less restrictive forms ol stationarity can he required, 
equating selected elements ol the compositing matrices across 
occasions. We can also place lower bounds on the compositing weights, 
lorcing all weights to be nonr ^gative, for example. 

In all cases, the ellect ol imposing additional constraints can be 
judged by comparing the resulting value ol Rp in (7) to it's value 
under a "Just- identified" model. II the increase in Rp produced by 
imposing stationarity constraints is acceptably small, we can proceed 
with stationary components. 

The limited information case. The limited inlormation case can 
also be treated in terms of a function of the form in (2). We begin by 
Uking the pattern matrix in (6) to be block diagonal, with nxm block 
diagonal matrices being the regression weights in the ith 
occasion 



P = 



Pi 0 0 ... 0 
0 Pg 0 0 ..0 



0 0 0 Pp 



(10) 



The risk function sicplifiea in this case (restricting G to be block 
diagonal) to 



A A / 



RL = E tr<Gi(Xi-Xi)(Xi-Xi) } (11) 
1 

The pattern matrices Pj which minimize Rl are ol the lorn 

Pl= 2iiWi(WiZiiWi)-l (12) 
and again we can choose the to minimize Rl. Without further 
constraints, the solution here is simply the usual "wi thin-occasion" 
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principal component solution. 

We can formulate the problem in terras of a function H by defining 
Dq to be the block diagonal weighting matrix, and tne matrix Dg 
to be a block diagonal matrix 



% = 



Sii 0 0 ... 0 
0 2^2 0 0 . .0 



(12) 



0 0 ^P 

Then R^^ is minimized by choosing to oiaximize 

HdVlDsDcPs.IV) = ^iW>^kP^^iip^r^} (13) 
We can again use our algorithm to impose constraints on D^. 
All ol the constraints considered in the lull information case are 
applicable here. The effect of imposing various constraints can again 
be evaluated by comparing the resulting value of Rl to the value 
given by the "Just-identified* principal component solution. 

Discussion 

In general, the full information solution will provide a better 
approximation to X in that Rl > Rp. However, the limited 
information solution is closer in spirit to the traditional principal 
component solution. Furthermore, a solution may not exist in the full 
Information case if ^.he components are highly correlated across 
occasions. Both solutions yield component scores which are correlated, 
both within and actoss occasions. 

The foregoing development has focused upon the compositing weight 
matrix D^. the weights used to derive the component scores from the 
observed variables. But we need not restrict our attention to the 
compositing weights, as either (9) or (13) can be written in terms of 
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th€ component pattern or component structure matrices. In the full 
information case, the pattern matrix P is sriven in (6), and the 
structure watrix is R = ZD^. We can write H in (9) in terms of 
either P or R 

H(P;G,2rl) = trfP'^GPCPVip)-^) (14) 
H(R;G.S"1^) = tr{R^CR(RVlR)-l) (15) 
Similarly, in the limited information case, we can express the block 
diagonal pattern and structure matrices as 0^ = DgD^^EgDsDi^^)"!' 
and Dr z DsDyf. The Junction H in (13) can be written in 
either of the following ways 

H(D^,;IV3,Dg-l) = tr{iy\^)^(D^-lD^)-l) (16) 
H(Dr:Og.DS"^) = tr{pRl^(DiDs-il)R)-lj (17) 
The point of this development is that we may apply constraints of the 
form in (3) to the component pattern or structure matrices, using our 
algorithm to maximize H. For example, we can impose stationarity 
constraints on the pattern or structi^re matrices. Naturally, we might 
employ such constraints to achieve sir^iple structure as well. 

To conclude, the component analysis method presented in this paper 
provides a flexible alternative to currently available approaches for 
the analysis of multivariate longitudinal data. I have written a 
program in FORTRAN to perform the analysis, although the program is not 
Exportable" at this time. An example of an application of the 
component method in the limited information case will appear this year 
in Psychology and Aging (Haan, Hillsap, & Hartka, 1986). 
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